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We study the diffusion of influence in random multiplex networks where links can be of r different 
types, and for a given content (e.g., rumor, product, political view), each link type is associated 
with a content dependent parameter d in [0, oo] that measures the relative bias type-i links have in 
spreading this content. In this setting, we propose a linear threshold model of contagion where nodes 
switch state if their "perceived" proportion of active neighbors exceeds a threshold r. Namely, a node 
connected to rm active neighbors and ki — rm inactive neighbors via type-i links will turn active if 
Y2, CiVai/ ^ Ciki exceeds its threshold r. Under this model, we obtain the condition, probability and 
expected size of 3/0 baZ spreading events. Our results extend the existing work on complex contagions 
in several directions by (i) providing solutions for coupled random networks whose vertices are neither 
identical nor disjoint, (ii) highlighting the effect of content on the dynamics of complex contagions, 
and (iii) showing that content-dependent propagation over a multiplex network leads to a subtle 
relation between the giant vulnerable component of the graph and the global cascade condition that 
is not seen in the existing models in the literature. 

PACS numbers: 89.75.Hc, 87.23. Ge 



I. INTRODUCTION 

In the past decade, there has been an increasing inter- 
est in studying dynamical processes on real-world com- 
plex networks. An interesting phenomenon that seems 
to occur in many such processes is the spreading of an 
initially localized effect throughout the whole (or, a very 
large part of the) network. These events are usually re- 
ferred to as (information) cascades and can be observed 
in processes as diverse as adoption of cultural fads, the 
diffusion of belief, norms, and innovations in social net- 
works [ll- ^ , disease contagion in human and animal pop- 
ulations J, m , failures in interdependent infrastructures 
rise of collective action to join a riot [l5|, and 
glob al spread of computer viruses or worms on the Web 

The current paper focuses on a class of dynamic pro- 
cesses usually known as binary decisions with external- 
ities [ij. In this model, nodes can be in either one of 
the two states: active or inactive. Each node is initially 
given a random threshold r in (0, 1] drawn independently 
from a distribution Pth(T). Then, starting from a small 
number of active vertices, nodes update their states (syn- 
chronously) in discrete time steps. An inactive node with 
m active neighbors and k — m inactive neighbors will be 
activated only if the fraction ^ exceeds r; once active, a 
node can not be deactivated. This model is sometimes re- 
ferred to as the Watts' threshold model. However, it was 
motivated by the seminal work of Schelling [l^ who em- 
ployed a threshold model to g ain insight into residential 
segregation. Granovetter [l^ also studied this model in 
a fully mixed population (i.e, one where each individual 
can affect any other one regardless of network topology) 
to characterize riot behavior. 

The starting point of the current work is the following 
basic observation. Most existing studies on this subject 
are based on the assumption that all the links in the net- 



work are of the same type; i.e., it is assumed that the 
underlying network is simplex. However, in reality, links 
might be classified according to the nature of the rela- 
tionship they represent, and each link type may play a 
different role in different cascade processes. For exam- 
ple, in the spread of a new consumer product amongst 
the population, a video game would be more likely to 
be promoted among high school classmates rather than 
among family members [Toj : the situation would be ex- 
actly the opposite in the case of a new cleaning product. 
Several other examples, both intuitive and real- world, 
can be given to show the relevance of link classification. 
A few of them include belief propagation in a coupled 
social-physical network (links between distant Facebook 
friends vs. links between close office-mates), cascading 
failures in interdependent networks (power links that are 
vulnerable to natural hazards vs. computer links that are 
vulnerable to viruses), and spread of worms or viruses 
over the Internet (worms that spread via E-mail contacts 
vs. worms that exploit particular system vulnerabilities; 
e.g^ , see the Internet (Morris) worm of November 2, 1988 

With this motivation in mind, in this paper we study 
the cascade processes in multiplex networks (2l| - [2^ : As- 
sume that the links in the network are classified into r 
different types 1, . . . , r. For a given content (a product, 
view, rumor, or a source of failure), consider positive 
scalars ci , . . . , , such that Ci quantifies the relative bias 
a type-z link has in spreading this particular content; i.e., 
the larger the constant c^, the more likely it is for the eon- 
tent to spread via type-? links. Then, we assume that an 
inactive node with threshold t gets activated if 



ClTUl + C2TO2 



> T 



(1) 



where to, (resp. ki) is the number of active neighbors 
(resp. number of neighbors) that the node is connected 
via a type-i link. In other words, instead of using the frac- 
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tion m/k of one's active neighbors, we use the content- 
dependent quantity X)i=i ^i^i^ hereafter re- 
ferred to as the perceived proportion of active neighbors 
j25| . This formulation allows a more accurate charac- 
terization of a node's influence on others' behavior with 
respect to spreading of various contents; the original case 
can easily be recovered by setting ci = . . . = Cr ~ 1. 

Under the condition ([T} for adoption, we are inter- 
ested in understanding whether a single node can trig- 
ger a global cascade; i.e., whether a linear fraction of 
nodes (in the asymptotic limit) eventually becomes ac- 
tive when an arbitrary node is switched to the active 
state. For ease of exposition, we consider the case where 
links are classified into two types; extension to r types is 
straightforward. Assuming that each link type defines a 
sub-network which is constructed according to the con- 
figuration model [26j , we find the conditions under which 
a global cascade is possible; the precise definition of the 
model is given in Section |lll In the cases where a global 
spreading event is possible, we find the exact probability 
of its taking place, as well as the final expected cascade 
size. 

These results constitute an extension of the results by 
Watts [l| in several directions: First, our work extends 
the previous results on single networks with arbitrary de- 
gree distribution to multiple overlay networks (27| where 
the vertex sets of the constituent networks are not dis- 
joint (as in modular networks [28|'). Second, by intro- 
ducing the condition ([T]) for adoption, our model is ca- 
pable of capturing the relative effect of content in the 
spread of influence, and our theory highlights how differ- 
ent content may have different spreading characteristics 
over the same network. Third, our analysis indicates that 
content-dependent propagation over a network with clas- 
sified links entails multiple notions of vulnerability (with 
respect to each link type) , resulting in a directed subgraph 
on vulnerable nodes. This leads to a subtle relation be- 
tween the giant vulnerable component of the graph and 
the global cascade condition in a manner different than 
the existing models 

Very recently, Brummitt et al. [ll] also studied the dy- 
namics of cascades in multiplex networks, but under a dif- 
ferent formulation then ours. There, they assumed that 
a node becomes active if the fraction of its active neigh- 
bors in any link type exceeds a certain threshold. With 
the notation introduced so far, this condition amounts to 

max I — ) > r. (2) 

i=l,....r \ ki J 

In setting the authors studied the threshold and the 
size of global cascades and found that multiplex networks 
are more vulnerable to cascades as compared to simplex 
networks. Although formulation ^ might be relevant 
for certain cases, it can not capture the effect of content 
in the cascade process. Furthermore, condition dTJ pro- 
posed here enables more general observations in terms of 
the vulnerability of multiplex networks: Depending on 
the content parameters ci,...,Cr, a multiplex network 



can be more, less, or equally vulnerable to cascades as 
compared to a simplex network with the same total de- 
gree distribution; e.g., see Section |lVl In fact, it always 
holds that 

min ( — < SLi Ci^Ttt ^ / mA 

j=l,...,r \ ki ) ~ Yl=i ~ V ki J 

However, it is worth noting that the results ob- 
tained here do not contain those of [g^] since one can 
not select ci,...,Cr such that '^i'^j/Si=i "^i^i = 

maxi=i^...^,. (mi/ki) holds for all possible {rui, ki}'^^^. 

The paper is structured as follows. In Section [II] we 
give the details of the system model. Analytical re- 
sults regarding the condition, probability, and the size 
of global cascades over the system model are given in 
Section IIIIl while in Section IIVI we present numerical re- 
sults that illustrate the main findings of the paper. We 
close with some remarks in Section |Vl 



II. MODEL DEFINITIONS 

For illustration purposes we give the model definitions 
in the context of an overlay social-physical network. We 
start with a set of individuals in the population repre- 
sented by nodes l,...,n. Let W stand for the physi- 
cal network of individuals that characterizes the possible 
spread of infiuence through reciprocal (i.e., mutual) com- 
munications; a link represents a reciprocal communica- 
tion if there is a message exchange in both directions over 
the link. Examples of reciprocal communications include 
face-to-facc communications, phone calls, chats, or mu- 
tual communications through an online social networking 
website. Next, we let F stand for a network that char- 
acterizes the spread of influence through non-reciprocal 
communications in an online social networking web site, 
e.g., Facebook [s^]. We assume that the physical network 
W is defined on the vertices JV ~ {1, 2, . . . , n} implying 
that each individual in the population is a member of W. 
Considering the fact that not everyone in the population 
is a member of online social networks, we assume that 
the network F is defined on the vertex set Afp where 

V[ieAfF]=a, i = l,...,n, (3) 

In other words, we assume that each node in TV is a 
member of F with probability a £ (0, 1] independently 
from any other node. 

We define the structure of the networks W and F 
through their respective degree distributions {p^} and 
In other words, for each k = 0, 1, . . ., a node in 
W (resp. in F) has degree k with probability p'^ (resp. 
pI). This corresponds to generating both networks (inde- 
pendently) according to the configuration model [SJ, H^] . 
Then, we consider an overlay network H that is con- 
structed by taking the union of W and F. In other words, 
for any distinct pair of nodes i,j, we say that i and j are 
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adjacent in the network H, denoted i j, as long as at 
least one of the conditions {i j} or {i ~f j} holds. 

The overlay network H = W U F constitutes an ensem- 
ble of the colored degree-driven random graphs studied by 
Soderberg jUjUl]. Let {1, 2} be the space of possible col- 
ors (or types) of edges in H; specifically, we let the edges 
in Facebook be of type 1, while the edges in the physical 
network are said to be of type 2. The colored degree of a 
node i is given by an integer vector fc' = (A;^^, k\), where 
k\ (resp. k^) stands for the number of Facebook edges 
(resp. physical connections) that are incident on node i. 
Clearly, the plain degree of a node is given by /c = ki + k2. 
Under the given assumptions on the degree distributions 
of W and F, the colored degrees (i.e., fc^,...,A;") will 
be independent and identically distributed according to 
a colored degree distribution {pk} such that 



Pk 



+ {l~a)l[k,^0])-pl, k = {k,M) (4) 



due to independence of F and W. The term (1 — 
a)l [fci = 0] accommodates the possibility that a node 
is not a member of the online social network, in which 
case the number fci of F-edges is automatically zero. 

Given that the colored degrees are picked such that 
y^7-i k\ and ^2 ^^^^ even, we construct H as in [ssl - 

|37| : Each node i = 1, . . . , n is first given the appropriate 
number k\ and of stubs of type 1 and type 2, respec- 
tively. Then, pairs of these stubs that are of the same 
type are picked randomly and connected together to form 
complete edges; clearly, two stubs can be connected to- 
gether only if they are of the same type. Pairing of stubs 
continues until none is left. 

Now, consider a complex contagion process in the ran- 
dom network H. As stated in the Introduction, we let 
each node i be assigned a binary value a{i) specifying 
its current state, active {(j{i) = 1) or inactive {(j{i) = 0). 
Each node is initially given a random threshold t in (0, 1] 
drawn independently from a distribution Pth(''"). Nodes 
update their states synchronously at times t ~ 0, 1, . . .. 
An inactive node will be activated at time t if, at time 
t — 1, its perceived proportion of active neighbors exceeds 
its threshold r. Namely, for a given content, let ci, C2 
be positive scalars that model the relative importance of 
type-1 and type-2 links, respectively, in spreading this 
content. Then, with k = (^1,^2) denoting its colored 
degree, and m = (mi, m2) denoting its number of active 
neighbors connected through a type-1 and type-2 link at 
time t — 1, respectively, a node will become active (at 
time t) with probability 



cimi + C2m2 

Ciki -f C2fc2 



> T 



;= F{m,k) 



Hereafter, F{m, k) will be referred to as the neighbor- 
hood influence response function 0, . To simplify the 
notation a bit, we let c := Ci/c2 for ci, C2 > so that we 
have 



F(m,fc) = 



Cmi -|- 7712 
Cfcl -t- fc2 



> T 



(5) 



The effect of content on the response of nodes can eas- 
ily be inferred from ([5]): For instance, c < 1 (resp. c > 1) 
means that the current content is more likely to be pro- 
moted through type-2 edges (resp. type-1 edges). The 
special case c = 1 corresponds to the situations where 
both types of links have equal effect in spreading the 
content and the response function ([S|) reduces to that of 
a standard threshold model [l|. In the limit c — > (resp. 
c 00), we see that type-1 (resp. type-2) edges have 
no effect in spreading the content and the network H be- 
comes identical to a single network W (resp. F) for the 
purposes of the spread of this particular content. 



III. ANALYTIC RESULTS 
A. Condition and Probability of Global Cascades 

We start our analysis by deriving the condition and 
probability of global spreading events in overlay social- 
physical network H. In most existing works [l|, [28l - [30j , 
the possibility of a global spreading event hinges heavily 
on the existence of a percolating cluster of nodes whose 
state can be changed by only one active neighbor; these 
nodes are usually referred to as vulnerable. In other 
words, the condition for a global cascade to take place 
was shown to be equivalent to the existence of a giant 
vulnerable cluster in the network; i.e., fractional size of 
the largest vulnerable cluster being bounded away from 
zero in the asymptotic limit n — >■ cx). The probability of 
triggering a global cascade was then shown to be equal 
to the fractional size of the extended vulnerable cluster, 
which contains nodes that have links to at least one node 
in the giant vulnerable component. 

Here, we will show that the situation is more com- 
plicated unless the content parameter c is unity. The 
subtlety arises from the need for defining the notion of 
vulnerability with respect to (w.r.t.) two different neigh- 
borhood relationships. Namely, a node is said to be 
W- vulnerable (resp. F- vulnerable) , if its state can be 
changed by a single link in W (resp. in F) that connects 
it to an active node; a node is simply said to be vulnera- 
ble, if it is vulnerable w.r.t. at least one of the networks. 
Note that unless c = 1, a node can be F- vulnerable but 
not W-vulnerable, or vice versa. Therefore, an active 
vulnerable node does not necessarily activate all of its 
vulnerable neighbors, and the ordinary definition of a 
vulnerable component becomes vague. Here, we choose 
a natural definition of a vulnerable component in the fol- 
lowing manner: A set of nodes that are vulnerable w.r.t. 
at least one of the networks are said to form a vulner- 
able component if in the subgraph induced by this set of 
nodes, activating any node leads to the activation of all 
the nodes in the set. 

In fact, the above definition of a vulnerable component 
coincides with that of a strongly connected component 
[l^H^ in a directed graph. To see this, consider the sub- 
graph of vulnerable nodes in H. This subgraph forms a 
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directed network, where a (potentially bi-directional) F- 
link between nodes i and j will have the direction from i 
to j (resp. j to i) only if j (resp. i) is F-vulnerable; simi- 
lar definitions determine the directions of W-links. There 
exist several definitions for the components of a directed 
graph, but we use that give n by Boguiia and Serrano (40l | 
which is adopted from [39[. Namely, for a given vertex, 
its out-component is defined as the set of vertices that are 
reachable from it. Similarly, the in-component of a vertex 
is the set of nodes that can reach that vertex. Then, the 
giant out-component (GOUT) of a graph is defined as 
the set of nodes with infinite in-component, whereas the 
set of nodes that have infinite out-component defines the 
giant in-component (GIN). Finally, the giant strongly- 
connected component (GSCC) of the graph is given by 
the intersection of GIN and GOUT. By definition, any 
pair of nodes in the GSCC arc connected to each other 
via a directed path. 

The picture is now clear. According to the definition 
adopted here, the giant vulnerable component of the net- 
work corresponds to the GSCC of the subgraph induced 
by vulnerable nodes. Moreover, global cascades can take 
place if there exists a linear fraction of vulnerable nodes 
whose out-component is infinite; i.e., the global cascade 
condition corresponds to the appearance of GIN amongst 
the vulnerable nodes of H. Finally, the probability of 
triggering a global cascade will be given by the fractional 
size of the extended GIN (EGIN) , that contains GIN and 
vertices that are not vulnerable but, once activated, can 
activate a node in GIN. 

In principle, it is possible for a directed network to 
have GIN but no GSCC 4l[, raising the possibility of 



observing global cascades even when there is no giant 
vulnerable cluster in the networ k; this p ossibility would 
contradict the previous results [l|, I28l430l |. However, in all 
models that appeared in the literature to date j39ll40l . l4^ . 
it was shown that GIN, GOUT and GSCC appear simul- 
taneously in the network. In our case, since the condition 
and probability of global cascades can be obtained by an- 
alyzing only GIN (and EGIN) , we do not give an analysis 
to show the simultaneous appearance of GIN and GSCC; 
instead, this step is taken via simulations in Section llVI 
From here onwards, GSCC, GOUT and GIN refer to re- 
spective components of the vulnerable nodes in IHI even if 
it is not said so explicitly. 

We now turn to computing the probability (and condi- 
tion) of triggering a global cascade by finding the size of 
EGIN of vulnerable nodes in the network H. This will be 
done by considering a branching process which starts by 
activating an arbitrary node, and then recursively reveals 
the largest number of vulnerable nodes that are reached 
and activated by exploring its neighbors. Utilizing the 
standard approach on generating functions [2^ . l37j . we 
can then determine the condition for the existence of GIN 
as well as fractional size of EGIN; note that by definition 
EGIN exists if and only if GIN does. This approach is 
valid long as the initial stages of the branching process 
is locally tree-like, which holds in this case as the clus- 



tering coefficient of colored degree-driven networks scales 
like 1/n as n gets large [43j . 

Throughout, we use pk,i (resp. Pk,2) to denote the 
probability that a node is F-vulnerable (resp. W- 
vulnerable). In other words, pk,i (resp. Pk,2) is the 
probability that an inactive node with colored degree 
fe becomes active when it has only one active neigh- 
bor in F (resp. in W) and zero active neighbor in W 
(resp. in F). We also use Pk,in2 to denote the probabil- 
ity that a node with colored degree fe is both F-vulnerable 
and W- vulnerable. In the same manner, we use Pfc,in2=, 
Pfc,i=n2, and Pk,iT\2'', to denote the probabilities that a 
node is F-vulnerable but not W- vulnerable, W- vulnerable 
but not F-vulnerable, and neither F-vulnerable nor W- 
vulncrable, respectively. More precisely, we set 



Pfc,in2 
Pfc,i=n2 



> T and — > r 



cki -)- k2 
c 



cki + /s^ 



< T and 



cki -f k2 
1 

cki -\- k2 



> T 



Similar relations define pk.i: Pk.2 Pk.i''n2, and pk,i''n2'=- 
It is clear that if c = 1, pk i = pk 2 = Pk,in2, whereas 
Pfc,i'=n2 = Pfc,in2^ = 0. 

We now solve for the survival probability of the afore- 
mentioned branching process by using the mean-field ap- 
proach based on the generating functions [2^, [s^] . Let 
gi{x) (resp. 52 (a;)) denote the generating functions for 
the finite number of nodes reached by following a type- 
1 (resp. type-2) edge in the above branching process. 
The generating functions gi (x) and 32 (x) satisfy the self- 
consistency equations 



gi{x) ^xY^ 



kiPk 
< ki> 



Pk.i-gi{xY'-'g2{xY' (6) 



< fci > 



92{X 



k2Pk 
<k2> 



Pk,2-9i{xf'92{xf'~' (7) 



, oY^ ^2Pk X 



The validity of © can be seen as follows: The explicit 
factor X accounts for the initial vertex that is arrived at. 
The factor kipk / < ki > gives the normalized probability 
that an edge of type 1 is attached (at the other end) to 
a vertex with colored degree fe. Since the arrived node is 
reached by a type-1 link, it needs to be F-vulnerable to be 
added to the vulnerable component. If the arrived node 
is indeed F-vulnerablc (happens with probability pk.i) it 
can activate other nodes via its remaining ki — I edges of 
type-1 and ^2 edges of type-2. Since the number of vul- 
nerable nodes reached by each of its type-1 (resp. type-2) 
links is generated in turn by gi{x) (resp. 52 (a^)) we ob- 



tain the term gi{x)'^^ 
generating functions 



by the powers property of 
Averaging over all possible 



colored degrees k gives the first term in The second 
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term with the factor x° = 1 accounts for the possibihty 
that the arrived node is not F- vulnerable and thus is not 
included in the cluster. The relation ^ can be validated 
via similar arguments. 

Using the relations (jH])-©, we now find the finite num- 
ber of vulnerable nodes reached and activated by the 
above branching process. With G{x) denoting the corre- 
sponding generating function, we get 



G(x) = 



k 



(8) 



Similar to ©-([T]), the relation ^ can be seen as follows: 
The factor x corresponds to the initial node that is se- 
lected arbitrarily and made active. The selected node 
has colored degree fc = (fci, fc2) with probability pk- The 
number of vulnerable nodes that are reached by each of 
its ki (resp. fc2) branches of type 1 (resp. type 2) is 
generated by gi(x) (resp. g2{x)). This yields the term 
gi{x)''^ g2{xY^ and averaging over all possible colored de- 
grees, we get dS]). 

We are interested in the solution of the recursive rela- 
tions ©-(O for the case x — 1. This case exhibits a triv- 
ial fixed point gi{\) = 52(1) — 1 which yields G{1) — 1 
meaning that the underlying branching process is in the 
subcritical regime and that all components have finite 
size as understood from the conservation of probability. 
However, the fixed point .gi(l) = 52(1) = 1 corresponds 
to the physical solution only if it is an attractor, i.e., a 
stable solution to the recursion (H])-©. The stability of 
this fixed point can be checked via linearization of (E))-© 
around gi{l) = 52(1) = 1, which yields the Jacobian Jp 
given by 



Jp — 



<(fci -fci)pfc,i > 
<fei> 

<fclfc2Pfc,2> 
<fe2> 



<fclfc2Pfc,l > 
<fci> 

<(fc^-fc2)pfc,2> 
</C2> 



(9) 



If all the eigenvalues of Jp are less than one in absolute 
value (i.e., if the spectral radius cr{Jp) of J is less than or 
equal to unity), then the solution 171(1) = g2(l) = 1 is sta- 
ble and G(l) = 1 becomes the physical solution, meaning 
that with high probability GIN does not exist. In that 
case, a global spreading event is not possible and the 
fraction S of infiuenced individuals always tends to zero. 
However, if the spectral radius of Jp is larger than unity, 
then another solution with 31(1), 52(1) < 1 becomes the 
attractor of ([SI)-© yielding a solution with G(l) < 1. 
In that case, global cascades are possible meaning that 
switching the state of an arbitrary node gives rise to a 
global spreading event with positive probability, Ptrig- 
In fact, the deficit 1 — G(l) corresponds to the probabil- 
ity that an arbitrary node, once activated, activates an 
infinite number of vulnerable nodes, which in turn corre- 
sponds to the probability of triggering a global cascade; 
i.e., we have 



We close this section by noting that G{x) corresponds 
to the size of the extended component EGIN, not GIN; 
i.e., 1 — G(l) gives the asymptotic size of EGIN as a frac- 
tion of the number of nodes n. This is because, in we 
have ignored the possibility of the initial node being not 
vulnerable; this makes sense since, in the cascade process, 
the initially selected node is forced to be active regardless 
of its state of vulnerability. In order to obtain the size 
of GIN, one should consider another generating function 
H{x) that is given by multiplying © with the proba- 
bility (1 — pfc,i<:n2<:) that the initial node is vulnerable 
and adding the term x'^^y.pkPk.i^ri2'=- The asymptotic 
size of GIN (as a fraction of n) would then be given by 
l-H{l). 



B. Expected Cascade Size 

We now compute expected final size of a global cas- 
cade when it occurs. Namely, we will derive the asymp- 
totic fraction of individuals that eventually become active 
when an arbitrary node is switched to active state. Our 
analysis is based on the work by Gleeson and Cahalane 
[4^ 1 and Gleeson [l^ who derived the expected final size 
of global spreading events on a wide range of networks. 
Their approach, which is built on the tools developed for 
analyzing the zero-temperature random-field Ising model 
on Bcthe lattic es |45l. is also adopted by several other au- 
thors; e.g., see [llEi, HO, HI . 

The discussion starts with the following basic observa- 
tion: If the network structure is locally tree-like (which 
holds here as noted before [i^), then we can replace EI 
by a tree structure where at the top level, there is a sin- 
gle node say with colored degree k = (fci, fc2)- In other 
words, the top node is connected to ki nodes via Face- 
book links and k2 nodes via physical links at the next 
lower level of the tree. Each of these fci (resp. k2) nodes 



(resp. 



have degree k' — ^"11 '"2; "^"^ i^i^i^^uii-i^j <ki> 
with probability ) , and they are in turn connected 



Pi 



trig 



l-G(l). 



to k'l — 1 (resp. k'^ ) nodes via Facebook links and ^3 (resp. 
^2 — 1) nodes via physical links at the next lower level of 
the tree; the minus one terms are due to the links that 
connect the nodes to their parent at the upper level. 

In the manner outlined above, we label the levels of 
the tree from ^ = at the bottom to ^ -> c« at the top 
of the tree. Without loss of generality, we assume that 
nodes update their states starting from the bottom of 
the tree and proceeding towards the top. In other words, 
we assume that a node at level £ updates its state only 
after all nodes at the lower levels 0, 1, . . . — 1 finish 
updating. Now, define qi^i (resp. q2j) as the probability 
that a node at level (. of the tree, which is connected to 
its unique parent by a type-1 link (resp. a type-2 link), 
is active given that its parent at level £ -I- 1 is inactive. 
Then, consider a node at level £ + \ that is connected 
to its parent at level ^ -I- 2 by a typc-1 link. This node 
has degree k = (fci,fc2) with probability -^^^ and the 
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probability that i of its Faccbook neighbors and j of its 
physical network connections are active is given by 



The minus one term on ki accommodates the fact that 
the parent of the node under consideration is inactive 
by the assumption that nodes update their states only 
after all the nodes at the lower levels finish updating. 
Further, the probability that a node becomes active when 
i of its ki Facebook connections, and j of its /c2 physical 
connections are active is given by 



k = (fci,fc2) 



by the definition of neighborhood response function 
F{Tn,k). 

Arguments similar to the above one leads to analogous 
relations for nodes that are connected to their unique 
parents by a type-2 link. Combining these, and averaging 
over all possible degrees and all possible active neighbor 
combinations, we arrive at the recursive relations 



< fci > 

k i=Q j=0 



x(i - qi,i) 



12,1+1 



k2Pk 



^ < fc2 > 
fc ^ i=0 j=0 



(12) 



x(l-gM)'^-^(^'^. ^yi/l^q2,,r^-'-\ 

for each £ = 0, 1, . . .. Under the assumption that nodes 
do not become inactive once they turn active, the quan- 
tities qi^i and q2,i are non-decreasing in i and thus, they 
converge to a limit (?i,oo and (j2,oo- Further, the final frac- 
tion of active individuals 5* is equal (in expected value) 
to the probability that the node at the top of the tree 
becomes active. Thus, we conclude that 



fci fc: 
k i=Q j=0 



ki 



Pk}^l^F{{z,j),k)[ : ql^{l~qi.^r 



\ki —i 



x(^^.)qiooi^-q2.^f'-'- (13) 

Under the natural condition F{{0,0),k) ~ 0, qi,oo ~ 
92,00 = is the trivial fixed point of the recursive equa- 
tions ([TT|) - (|12p . In view of ([T^. this trivial solution yields 
S' = pointing out the non-existence of global spreading 
events. However, the trivial fixed point may not be sta- 
ble and another solution with (71,00,92,00 > may exist. 
In fact, the condition for the existence of a non-trivial 
solution can be obtained by checking the stability of the 
trivial fixed point via linearization at qu = 52, f = 0. The 



entries of the corresponding Jacobian matrix is given 
by 



Js = 



<(fci-fci)Pfca> <fcifc2Pfc.2> 

<fcl> <fel> 

<fclfc2Pfc,l> <(kl—k2)ph,2> 



<k2> 



<k2> 



By direct inspection, it is easy to see that the spectral 
radius of Js is equal to that of the matrix Jp defined in 
(l9|); this follows from the facts that Js{i,i) = Jp{i,i) for 
i = 1,2 and Js{l,2)-Js{2,l) = Jp(l, 2) • Jp(2, 1). Hence, 
as would be expected, we find that the recursive relations 
(fTT|l - (fT2|) give the same global cascade condition (namely, 
cr(Jp) > 1) as the recursive relations ©-([T]) obtained 
through utilizing generating functions. Nevertheless, the 
generating functions approach is useful in its own right 
as it enables quantifying the probability /trig of global 
cascades. 



IV. NUMERICAL RESULTS 

We now illustrate our findings by numerical simula- 
tions. In our first example we consider n = 5 x 10^ nodes 
in the physical network W and assume that only half of 
these nodes are members of the network F; i.e., we set 
a ~ 0.5. Following the references [H, [13, [H, we fix the 
thresholds at r = r"^ = 0.18 and assume that both F and 
W are Erdos-Renyi networks [s^ with mean degrees zi 
and Z2, respectively. Figure [T] shows the fractional size of 
the giant vulnerable cluster Sy, the triggering probabil- 
ity Ptrig, and the expected cascade size S w.r.t. zi = zi 
for two different contents. For the first content C\ we 
assume that c = 0.25, meaning that type-2 links are four 
times as important as type-1 links in spreading this con- 



tent, and plot the corresponding results in Figure 1(a) 
For the second content C2, we assume that both types 
of links are equivalent w.r.t. spreading the content, i.e., 
c = 1, and show the results in Figure 



1(b) In all eases. 



lines correspond to our analysis results from Eqs. ([S]) 
and (|13p . whereas symbols are obtained from computer 
simulations: For each parameter set, we generated inde- 
pendent realizations of the graphs F and W, and then 
observed the cascade process over the graph H upon ac- 
tivating an arbitrary node. The size iS'tj of the giant vul- 
nerable component is computed by finding the GSCC of 
the directed graph induced by the vulnerable nodes as 
described in Section IIII Al The results are given by av- 
eraging over 100 (resp. 5,000) independent runs for 5"^ 
and S (resp. Ptng)- 

Figure [1] leads to a number of interesting observations. 
First, we see an excellent agreement between the analyti- 
cal results and simulations, confirming the validity of our 
analysis; the discrepancy near the upper phase transition 
is due to finite size effect. Second, we see how content 
might impact the dynamics of complex contagions over 
the same network. For content C\ we see that global cas- 
cades are possible when 1.0 < Z\ = Z2 < 4.9, whereas 
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FIG. 1. (Color online) The fractional size of the giant vulnerable cluster, Sv, the final cascade size S and the triggering 
probability Ptrig are plotted for n = 5 x 10^, a = 0.5, r is fixed at r* = 0.18, and F and W are Erdos-Renyi networks with 
mean degrees z\ and Z2, respectively. The content parameter is taken to be (a) c = 0.25 and (b) c = 1. The lines correspond 
to the analytical results obtained from Eqs. (O) and ijl^l) . wfiereas symbols represent experimental results averaged over 100 
independent realizations for Sv and S, and over 5, 000 realizations for Ptrig- (Insets) The same plots at a higher resolution on 
Zi = Z2 near the lower phase transition. 



C2 can spread globally only if 0.7 < = 22 < 3.9. We 
also see that on the range where global cascades are pos- 
sible for both contents (namely 1.0 < zi = Z2 < 3.9), 
the probability of them taking place can still differ sig- 
nificantly; e.g., if zi = Z2 = 3, we have Ptrig = 0.84 for 
Ci, while for C2 we have Ptrig = 0.65. Finally, simula- 
tions confirm the simultaneous appearance of the GSCC 
and GIN in the subgraph of vulnerable nodes in IH as un- 
derstood from the identical parameter ranges that give 
positiv e val ues for S, Ptrig and Sy; see also the Inset of 
Figure 1 (a) Therefore, the possibility of observing global 
cascades without a giant vulnerable cluster is ruled out 
in our model, although this possibility exists in general. 

For a better demonstration of the effect of content 
on the probability and size of global cascades, wc now 
consider a different experimental set-up. This time, for 
three different cases, we fix all the parameters except the 
content-parameter c, and observe the variation of Ptrig 
and S with respect to c. The results are depicted in 
Figure [2] In all cases, we set n = 5 x 10^, a = 0.5, 
r = T* = 0.18, and assume that F and W arc Erdos- 
Renyi networks with mean degrees zi and Z2, respec- 
tively. In Figure 2(a) wc consider the case zi = 1.5 
and Z2 ~ 5.5, and see that global cascades are possible 
only for < c < 0.27, and c > 3.22, but no global cas- 
cade can take place in the range 0.28 < c < 3.21. This 
can be explained as follows: When c is too small, the 
spreading of the content is governed solely by network 
W (with average degree Z2 = 5.5), on which large global 
cascades arc possible with very low probability; in other 
words network W is close to the upper phase transition 



threshold. As c gets larger, the effect of F— links becomes 
considerable, and the connectivity of the overall network 
(with respect to the spread of the current content) in- 
creases. This eventually causes global cascades to disap- 
pear due to the high local stability (i.e., connectivity) of 
the nodes. However, further increase in c shifts the bias 
towards F— links, and due to lower average degree in F, 
this causes a decrease in the local stability of the nodes 
and brings global cascades back to the existence. 

In Figure 2(b)[ we set zi = Z2 = 0.7. This time, we see 
an exactly opposite dependence of cascade sizes on the 
parameter c. Namely, global cascades are not possible 
when c is too small or too large, but they do take place 
in the interval 0.5 < c < 2.5. This is because, under 
the current setting, both networks F and W have limited 
connectivity, so that global cascades do not take place 
in either of the networks separately. Therefore, if c is 
too small (resp. too large), only W (resp. F) can spread 
the content and all triggering events have finite size as 
confirmed in the plot by the non-existence of global cas- 
cades for c < 0.4 and c > 2.6. But, for c relatively close 
to unity, the two networks spread the content collabora- 
tively, yielding a high enough connectivity in the overall 
network H to achieve a positive probability of global cas- 
cades. 

The situation is somewhat different in the case of Fig- 
ure 2(c) where we have zi = 6.0 and Z2 = 1.5: For con- 
tents that mainly spread over W-links, i.e., for c close 
to zero, global spreading events take place with posi- 
tive probability since network W (with average degree 
Z2 = 1.5) satisfies the global spreading condition. How- 
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FIG. 2. (Color online) We see tie variation of cascade probability Ptrig and cascade size S with respect to the content parameter 
c, when a = 0.5, t — t* — 0.18, and ¥ and W are Erdos-Renyi networks with mean degrees zi and Z2, respectively. We set (o) 

Zl = 1.5, 22 ~ 5.5, (6) 2l = 22 = 0.7, (c) 2l = 6.0, 22 = 1.5. 



ever, as c gets larger, the high average degree in network 
F (and, thus the high local stability of the nodes) makes 
it harder for the content to spread in the overall network 
H, eventually causing the probability of global cascades 
drop to zero. This is confirmed in Figure 2(c) as we see 
that global cascades take place only for contents with 
c < 0.5 and any content with c > 0.6 dies-out before 
reaching a non-trivial fraction of the network. 

We continue our simulation study by depicting the 
variation of the cascade window with respect to content 
parameter c in Figure 3(a) We see that for each t*, 



the parameter c changes the range of zi = 22 for which 
global cascades can occur in a non-trivial way. For in- 
stance, none of the three regions cover one another. In 
fact, the cascade window for c = 4 is contained in that of 
c = 1 for most of the r* values, but, with zi = Z2 = 4 and 
T* in (0.17,0.18) cascades do not occur for c = 1 while 
they do for c = 4. We also sec that the content parameter 
c can effect the maximum threshold t* for which global 
cascades are possible. When c — 1 and c = 0.1, cascades 
can take place for t* < 0.25, whereas the upper-bound 
is reduced to 0.23 for c — A. 

Finally, we test our theory for networks which are not 
locally tree like. In fact, most real networks arc known 
[47! to exhibit a phenomenon often called clustering (or 
transitivity), informally defined as the propensity of a 
node's neighbors to be neighbors as well. Since our the- 
ory is developed for networks that do not have clustering, 
we do not expect our results to provide good estimations 
for clustered networks; in the case of Watts' threshold 
model, it is already shown [ll] that clustering can have a 
significant impact on the size of global cascades. Never- 
theless, we would like to provide the first step in showing 
the effect of clustering on content-dependent cascading 
processes in multiplex networks. 

To this end, we generate random clustered networks 
F and W as prescribed by Newman [4^ and Miller 
[49} . Namely, we consider distributions p{( and p™j 
that give the probability of a node being connected to 



s single edges and t triangles; conventional degree dis- 
tributions are then given by p(. = ^ p{fSk,s+2t and 
Pk ~ J2s tPft^k.s+2t- For convenience, we consider a 
doubly Poisson distribution for pst', namely, we set 



/ 

Pst 



(^1/2)* 
tl 



s,t = 0,1,. 



(14) 



and define p™^ similarly with zi replaced by Z2- Notice 
that average degrees are now given by 2zi and 2z2 in 
networks F and W, respectively. With n = 10^, a = 0.5, 
T — T* = 0.18 and c = 0.5, wc show in Figure [3(b)| the 
variation of the cascade size S with respect to average 
degrees 2zi = 2z2- As before, the line corresponds to the 
analytical solution (obtained from Eq. ([T3| ). whereas 
symbols are obtained from simulations by averaging over 
50 experiments for each point. Also, in the Inset of Fig- 
ure 3(b)[ we plot the average clustering coefficient [4^ 
observed for networks W and F; with zi = Z2 both net- 
works have (statistically) identical clustering coefficients. 

As expected, we do not see a good match between the 
predictions of our analysis (zero-clustering) and the ac- 
tual cascade size from experiments (positive clustering). 
However, these results agree with the double faceted pic- 
ture drawn in [sl] for the effect of clustering on cascade 
sizes in Watts' model: When average degrees are small, 
clustering decreases the expected size of global cascades, 
whereas after a certain value of average degree, clustering 
increases the expected cascade size. 



V. CONCLUSION 

We have determined the condition, probability and the 
size of global cascades in random networks with classi- 
fied links. This is done under a new contagion model 
where nodes switch state when their perceived (content- 
dependent) proportion of active neighbors exceed a cer- 
tain threshold. Our results highlight the effect of con- 
tent and link classification in the characteristics of global 
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FIG. 3. (Color online) (a) We see the cascade windows for several c values when a — 0.5, r = t* , and ¥ and W are Erdds-Renyi 
networks with mean degrees z\ and Z2, respectively. In other words, the lines enclose the region of the (r*, zi =22) plane for 
which the global cascade condition (j{Jp) > 1 is satisfied; outside the boundary, we have (j{Jp) < 1 and global cascades can not 
take place, (b) Global cascade size S when F and W are random clustered networks with degree distributions given by i)i4D . 
We take n = IC, a = 0.5, t = r* = 0.18 and c = 0.5. The line corresponds to analytical prediction from i|J3D . wiereas symbols 
are obtained from simulations by averaging over 50 independent realizations. (Inset) Average clustering coefficient versus 
2zi = 2z2. 



cascades, and show how different content may have dif- 
ferent spreading characteristics over the same network. 
Also, the resuhs given here extend the existing work on 
complex contagions to multiple overlay networks whose 
vertices are not disjoint. 

Our findings also contain some of the existing results 
as special cases. For instance, our results may be applied 
to a wide range of processes on the network H by ap- 
propriately selecting the neighborhood response function 
F{m, k). In particular, the general results of Section Hill 
include the solutions for bond percolation, and simple 
contagion [s^ processes by setting p^.i = = T for 
some transmissibility T in [0,1]. The threshold model 
of Watts [l| is also covered by our theory by setting the 
content parameter c to unity in all cases. 

We believe that the results presented here give some 
interesting insights into the cascade processes in complex 
networks. In particular, our results might help better un- 
derstand such processes and may lead to more efiicient 
control of them. Controlling cascade processes is partic- 
ularly relevant when dealing with cascading failures in 
interdependent structures as well as when marketing a 



certain consumer product. Finally, the formulation pre- 
sented here opens many new questions in the field. For 
instance, the dynamics of cascade processes are yet to be 
investigated on clustered or degree-correlated networks 
under the content-dependent threshold model introduced 
here. Another challenging problem would be to formalize 
the results given in this paper without using a mean-field 
approach; in fact, very recently Lelarge and co-workers 
[51I [52! have obtained rigorous results for the condition 
and size of global cascades in Watts' threshold model. 
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